DeQue: A Lexicon of Complex Prepositions and Conjunctions in French
نویسندگان
چکیده
We introduce DeQue, a lexicon covering French complex prepositions (CPRE) like à partir de (from) and complex conjunctions (CCONJ) like bien que (although). The lexicon includes fine-grained linguistic description based on empirical evidence. We describe the general characteristics of CPRE and CCONJ in French, with special focus on syntactic ambiguity. Then, we list the selection criteria used to build the lexicon and the corpus-based methodology employed to collect entries. Finally, we quantify the ambiguity of each construction by annotating around 100 sentences randomly taken from the FRWaC. In addition to its theoretical value, the resource has many potential practical applications. We intend to employ DeQue for treebank annotation and to train a dependency parser that takes complex constructions into account.
منابع مشابه
PrepLex: A Lexicon of French Prepositions for Parsing
PrepLex is a lexicon of French prepositions which provides all the syntactic information needed for parsing. It was built by comparing and merging several authoritative lexical sources. This lexicon also includes information about the prepositions or classes of prepositions that appear in French verb subcategorization frames. This resource has been developed as a first step in making current Fr...
متن کاملOntology and Lexical Semantics for Generating Temporal Discourse Markers
In text, temporal relations between events can be signalled in several ways; among them are speciic lexical items, here called temporal discourse markers. We analyse the semantics of about 20 German subordinating conjunctions and prepositions and transfer these ndings to a sentence generation framework that uses a dedicated discourse marker lexicon for producing complex sentences. After discuss...
متن کاملAttacking Parsing Bottlenecks with Unlabeled Data and Relevant Factorizations
Prepositions and conjunctions are two of the largest remaining bottlenecks in parsing. Across various existing parsers, these two categories have the lowest accuracies, and mistakes made have consequences for downstream applications. Prepositions and conjunctions are often assumed to depend on lexical dependencies for correct resolution. As lexical statistics based on the training set only are ...
متن کاملJoint Dependency Parsing and Multiword Expression Tokenization
Complex conjunctions and determiners are often considered as pretokenized units in parsing. This is not always realistic, since they can be ambiguous. We propose a model for joint dependency parsing and multiword expressions identification, in which complex function words are represented as individual tokens linked with morphological dependencies. Our graphbased parser includes standard secondo...
متن کاملTowards Invariant Meanings Of Spatial Prepositions And Preverbs
This work presents the semantical analysis of the two spatial prepositions and associated prefixes, the French sur, sur-(on) and the Polish przez, prze-(across). We propose a theory of abstract places (loci), as a method of description which helps to build an invariant meanings of the two linguistics units. 1 Introduction Natural languages encode spatial and temporal representations in many var...
متن کامل